174 research outputs found

    MICheck: a web tool for fast checking of syntactic annotations of bacterial genomes

    Get PDF
    The annotation of newly sequenced bacterial genomes begins with running several automatic analysis methods, with major emphasis on the identification of protein-coding genes. DNA sequences are heterogeneous in local nucleotide composition and this leads sometimes to sequences being annotated as authentic genes when they are not protein-coding genes or are true but uncharacterized protein-coding genes. This first annotation step is generally followed by an expert manual annotation of the predicted genes. The genomic data (sequence and annotations) organized in an appropriate databank file format is subsequently submitted to an entry point of the International Nucleotide Sequence Database. These procedures are inevitably subject to mistakes, and this can lead to unintentional syntactic annotation errors being stored in public databanks. Here, we present a new web program, MICheck (MIcrobial genome Checker), that enables rapid verification of sets of annotated genes and frameshifts in previously published bacterial genomes. The web interface allows one easily to investigate the MICheck results, i.e. inaccurate or missed gene annotations: a graphical representation is drawn, in which the genomic context of a unique coding DNA sequence annotation or a predicted frameshift is given, using information on the coding potential (curves) and annotation of the neighbouring genes. We illustrate some capabilities of the MICheck site through the analysis of 20 bacterial genomes, 9 of which were selected for their ‘Reviewed’ status in the National Center for Biotechnology Information (NCBI) Reference Sequence Project (RefSeq). In the context of the numerous re-annotation projects for microbial genomes, this tool can be seen as a preliminary step before the functional re-annotation step to check quickly for missing or wrongly annotated genes. The MICheck website is accessible at the following address:

    Iterative reconstruction of a global metabolic model of Acinetobacter baylyi ADP1 using high-throughput growth phenotype and gene essentiality data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Genome-scale metabolic models are powerful tools to study global properties of metabolic networks. They provide a way to integrate various types of biological information in a single framework, providing a structured representation of available knowledge on the metabolism of the respective species.</p> <p>Results</p> <p>We reconstructed a constraint-based metabolic model of <it>Acinetobacter baylyi </it>ADP1, a soil bacterium of interest for environmental and biotechnological applications with large-spectrum biodegradation capabilities. Following initial reconstruction from genome annotation and the literature, we iteratively refined the model by comparing its predictions with the results of large-scale experiments: (1) high-throughput growth phenotypes of the wild-type strain on 190 distinct environments, (2) genome-wide gene essentialities from a knockout mutant library, and (3) large-scale growth phenotypes of all mutant strains on 8 minimal media. Out of 1412 predictions, 1262 were initially consistent with our experimental observations. Inconsistencies were systematically examined, leading in 65 cases to model corrections. The predictions of the final version of the model, which included three rounds of refinements, are consistent with the experimental results for (1) 91% of the wild-type growth phenotypes, (2) 94% of the gene essentiality results, and (3) 94% of the mutant growth phenotypes. To facilitate the exploitation of the metabolic model, we provide a web interface allowing online predictions and visualization of results on metabolic maps.</p> <p>Conclusion</p> <p>The iterative reconstruction procedure led to significant model improvements, showing that genome-wide mutant phenotypes on several media can significantly facilitate the transition from genome annotation to a high-quality model.</p

    Evolutionary Evidence of Algal Polysaccharide Degradation Acquisition by Pseudoalteromonas carrageenovora 9T to Adapt to Macroalgal Niches

    Get PDF
    About half of seaweed biomass is composed of polysaccharides. Most of these complex polymers have a marked polyanionic character. For instance, the red algal cell wall is mainly composed of sulfated galactans, agars and carrageenans, while brown algae contain alginate and fucose-containing sulfated polysaccharides (FCSP) as cell wall polysaccharides. Some marine heterotrophic bacteria have developed abilities to grow on such macroalgal polysaccharides. This is the case of Pseudoalteromonas carrageenovora 9T (ATCC 43555T), a marine gammaproteobacterium isolated in 1955 and which was an early model organism for studying carrageenan catabolism. We present here the genomic analysis of P. carrageenovora. Its genome is composed of two chromosomes and of a large plasmid encompassing 109 protein-coding genes. P. carrageenovora possesses a diverse repertoire of carbohydrate-active enzymes (CAZymes), notably specific for the degradation of macroalgal polysaccharides (laminarin, alginate, FCSP, carrageenans). We confirm these predicted capacities by screening the growth of P. carrageenovora with a large collection of carbohydrates. Most of these CAZyme genes constitute clusters located either in the large chromosome or in the small one. Unexpectedly, all the carrageenan catabolism-related genes are found in the plasmid, suggesting that P. carrageenovora acquired its hallmark capacity for carrageenan degradation by horizontal gene transfer (HGT). Whereas P. carrageenovora is able to use lambda-carrageenan as a sole carbon source, genomic and physiological analyses demonstrate that its catabolic pathway for kappa- and iota-carrageenan is incomplete. This is due to the absence of the recently discovered 3,6-anhydro-D-galactosidase genes (GH127 and GH129 families). A genomic comparison with 52 Pseudoalteromonas strains confirms that carrageenan catabolism has been recently acquired only in a few species. Even though the loci for cellulose biosynthesis and alginate utilization are located on the chromosomes, they were also horizontally acquired. However, these HGTs occurred earlier in the evolution of the Pseudoalteromonas genus, the cellulose- and alginate-related loci being essentially present in one large, late-diverging clade (LDC). Altogether, the capacities to degrade cell wall polysaccharides from macroalgae are not ancestral in the Pseudoalteromonas genus. Such catabolism in P. carrageenovora resulted from a succession of HGTs, likely allowing an adaptation to the life on the macroalgal surface

    Comparative Genomics of Multidrug Resistance in Acinetobacter baumannii

    Get PDF
    Acinetobacter baumannii is a species of nonfermentative gram-negative bacteria commonly found in water and soil. This organism was susceptible to most antibiotics in the 1970s. It has now become a major cause of hospital-acquired infections worldwide due to its remarkable propensity to rapidly acquire resistance determinants to a wide range of antibacterial agents. Here we use a comparative genomic approach to identify the complete repertoire of resistance genes exhibited by the multidrug-resistant A. baumannii strain AYE, which is epidemic in France, as well as to investigate the mechanisms of their acquisition by comparison with the fully susceptible A. baumannii strain SDF, which is associated with human body lice. The assembly of the whole shotgun genome sequences of the strains AYE and SDF gave an estimated size of 3.9 and 3.2 Mb, respectively. A. baumannii strain AYE exhibits an 86-kb genomic region termed a resistance island—the largest identified to date—in which 45 resistance genes are clustered. At the homologous location, the SDF strain exhibits a 20 kb-genomic island flanked by transposases but devoid of resistance markers. Such a switching genomic structure might be a hotspot that could explain the rapid acquisition of resistance markers under antimicrobial pressure. Sequence similarity and phylogenetic analyses confirm that most of the resistance genes found in the A. baumannii strain AYE have been recently acquired from bacteria of the genera Pseudomonas, Salmonella, or Escherichia. This study also resulted in the discovery of 19 new putative resistance genes. Whole-genome sequencing appears to be a fast and efficient approach to the exhaustive identification of resistance genes in epidemic infectious agents of clinical significance

    A Novel Acyl-CoA Beta-Transaminase Characterized from a Metagenome

    Get PDF
    BACKGROUND: Bacteria are key components in all ecosystems. However, our knowledge of bacterial metabolism is based solely on the study of cultivated organisms which represent just a tiny fraction of microbial diversity. To access new enzymatic reactions and new or alternative pathways, we investigated bacterial metabolism through analyses of uncultivated bacterial consortia. METHODOLOGY/PRINCIPAL FINDINGS: We applied the gene context approach to assembled sequences of the metagenome of the anaerobic digester of a municipal wastewater treatment plant, and identified a new gene which may participate in an alternative pathway of lysine fermentation. CONCLUSIONS: We characterized a novel, unique aminotransferase that acts exclusively on Coenzyme A (CoA) esters, and proposed a variant route for lysine fermentation. Results suggest that most of the lysine fermenting organisms use this new pathway in the digester. Its presence in organisms representative of two distinct bacterial divisions indicate that it may also be present in other organisms

    The CanOE Strategy: Integrating Genomic and Metabolic Contexts across Multiple Prokaryote Genomes to Find Candidate Genes for Orphan Enzymes

    Get PDF
    Of all biochemically characterized metabolic reactions formalized by the IUBMB, over one out of four have yet to be associated with a nucleic or protein sequence, i.e. are sequence-orphan enzymatic activities. Few bioinformatics annotation tools are able to propose candidate genes for such activities by exploiting context-dependent rather than sequence-dependent data, and none are readily accessible and propose result integration across multiple genomes. Here, we present CanOE (Candidate genes for Orphan Enzymes), a four-step bioinformatics strategy that proposes ranked candidate genes for sequence-orphan enzymatic activities (or orphan enzymes for short). The first step locates “genomic metabolons”, i.e. groups of co-localized genes coding proteins catalyzing reactions linked by shared metabolites, in one genome at a time. These metabolons can be particularly helpful for aiding bioanalysts to visualize relevant metabolic data. In the second step, they are used to generate candidate associations between un-annotated genes and gene-less reactions. The third step integrates these gene-reaction associations over several genomes using gene families, and summarizes the strength of family-reaction associations by several scores. In the final step, these scores are used to rank members of gene families which are proposed for metabolic reactions. These associations are of particular interest when the metabolic reaction is a sequence-orphan enzymatic activity. Our strategy found over 60,000 genomic metabolons in more than 1,000 prokaryote organisms from the MicroScope platform, generating candidate genes for many metabolic reactions, of which more than 70 distinct orphan reactions. A computational validation of the approach is discussed. Finally, we present a case study on the anaerobic allantoin degradation pathway in Escherichia coli K-12

    A complete collection of single-gene deletion mutants of Acinetobacter baylyi ADP1

    Get PDF
    We have constructed a collection of single-gene deletion mutants for all dispensable genes of the soil bacterium Acinetobacter baylyi ADP1. A total of 2594 deletion mutants were obtained, whereas 499 (16%) were not, and are therefore candidate essential genes for life on minimal medium. This essentiality data set is 88% consistent with the Escherichia coli data set inferred from the Keio mutant collection profiled for growth on minimal medium, while 80% of the orthologous genes described as essential in Pseudomonas aeruginosa are also essential in ADP1. Several strategies were undertaken to investigate ADP1 metabolism by (1) searching for discrepancies between our essentiality data and current metabolic knowledge, (2) comparing this essentiality data set to those from other organisms, (3) systematic phenotyping of the mutant collection on a variety of carbon sources (quinate, 2-3 butanediol, glucose, etc.). This collection provides a new resource for the study of gene function by forward and reverse genetic approaches and constitutes a robust experimental data source for systems biology approaches

    Complete genome sequence of the entomopathogenic and metabolically versatile soil bacterium Pseudomonas entomophila

    Get PDF
    Pseudomonas entomophila is an entomopathogenic bacterium that, upon ingestion, kills Drosophila melanogaster as well as insects from different orders. The complete sequence of the 5.9-Mb genome was determined and compared to the sequenced genomes of four Pseudomonas species. P. entomophila possesses most of the catabolic genes of the closely related strain P. putida KT2440, revealing its metabolically versatile properties and its soil lifestyle. Several features that probably contribute to its entomopathogenic properties were disclosed. Unexpectedly for an animal pathogen, P. entomophila is devoid of a type III secretion system and associated toxins but rather relies on a number of potential virulence factors such as insecticidal toxins, proteases, putative hemolysins, hydrogen cyanide and novel secondary metabolites to infect and kill insects. Genome-wide random mutagenesis revealed the major role of the two-component system GacS/GacA that regulates most of the potential virulence factors identified
    • 

    corecore